Math Indexer and Searcher under the Hood: History and Development of a Winning Strategy
نویسندگان
چکیده
This paper describes and summarizes experience of Masaryk University Math Information Retrieval team (MIRMU) with the mathematical search developed and performed for the NTCIR-11 Math-2 Task. Our approach is the similarity search based on canonicalized MathML and second generation of scalable full text search engine Math Indexer and Searcher (MIaS) with attested state-of-the-art information retrieval techniques like query expansion. The capability of MIaS system in terms of math query notation, normalization and combining math with textual query tokens was deployed by submitting multiple runs with four query notations provided, and with results merged from multiple queries. The analysis of the evaluation results shows that the system performs best using TEX queries that are translated and canonicalized to Content MathML, where MIaS ranked as #1 for all metrics returning very relevant results.
منابع مشابه
Math Indexer and Searcher under the Hood: Fine-tuning Query Expansion and Unification Strategies
This paper summarizes the experience of Math Information Retrieval team of Masaryk University (MIRMU) with the NTCIR-12 MathIR arXiv Main Task and its subtasks. We based our approach on the MIaS system. Based on NTCIR-11 Math-2 Task relevance judgements, we developed an evaluation platform. Using this platform we rigorously evaluated combinations of new features and picked the most promising on...
متن کاملIndexing and Searching Mathematics in Digital Libraries
This paper surveys approaches and systems for searching mathematical formulae in mathematical corpora and on the web. The design and architecture of our MIaS (Math Indexer and Searcher) system is presented, and our design decisions are discussed in detail. An approach based on Presentation MathML using a similarity of math subformulae is suggested and verified by implementing it as a math-aware...
متن کاملSimilarity Search for Mathematics: Masaryk University Team at the NTCIR-10 Math Task
This paper describes and summarizes experiences of Masaryk University team MIRMU with the mathematical search performed for the NTCIR pilot Math Task. Our approach is the similarity search based on enhanced full text search utilizing attested state-of-the-art techniques and implementations. The variability of used Math Indexer and Searcher (MIaS) system in terms of the math query notation was t...
متن کاملIdentifying the pattern of the talent management as the winning strategy of the organization; A study in the National Iranian South Oil Company
This study was conducted to identify the dimensions, components and indices of the talent management in the National Iranian South Oil Company. The study has been considered an applied research in terms of its purpose and in terms of data was qualitative and it has been done based on grounded theory in terms of the nature of the implementation. The statistical population of the study was expe...
متن کاملA GA Model Development for Decision Making Under Reverse Logistics
Managing products’ end-of-life and recovery of used products is gaining significant importance during last years. Therefore, managing the reverse flow of products can be an important potential for winning consumers in future competitive markets. In this context, establishing reverse logistics networks is becoming a main problem in reverse supply chains. Genetic Algorithm (GA) is utilized to s...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2014